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IN THE UNITED STATES PATENT AND TRADEMARK OFFICE 



In Re Application of: 

David J. McDonnell, et al. 

Application No: 09/186,046 

Filed: November 03, 1998 

For: Cross-Clock Domain Data Transfer 
Method and Apparatus 



Examiner: Unknown 
Art Unit: 2816 



Assistant Commissioner for Patents 
Washington, D.C 20231 

PRELIMINARY AMENDMENT 

Dear Sir: 

Applicants respectfully request that this Preliminary Amendment be entered prior 
to examination of the above-identified patent application. 
IN THE SPECIFICATION 

Please amend the specification as follows: 

Page 1, line 3 before "The" insert -This is a continuation of U.S. Patent 
Application Serial Number: 09/186,046, filed on November 03, 1998, and entitled Cross- 
Clock Domain Data Transfer Method and Apparatus.— 
IN THE CLAIMS 



Please cancel claims 1-12 without prejudice. 
Please add the following new claims: 



13. (New) An apparatus, comprising: 



2 a) a host side memory controller region having a memory access 

3 request input and a memory command packet chunk output, a memory 

4 command packet chunk being a portion of a memory command packet, said host 

5 side memory controller region clocked by a first clock; and 

6 b) a memory side memory controller region having a memory 

7 command packet chunk input coupled to said memory command packet chunk 

8 output said memory side memory controller region clocked by a second clock, 

9 said second clock different than said first clock. 

W i 14. (New) The apparatus of claim 13 wherein said memory command 

% 2 packet chunk output further comprises a row output and a column output. 

1" 1 15. (New) The apparatus of claim 13 wherein said host side memory 

\ti 2 controller region further comprises a scheduler coupled to said memory access 

% 3 request input, said scheduler configured to generate said memory command 

7 4 packet. 



1 16. (New) The apparatus of claim 15 wherein said scheduler is coupled 

2 to a queue. 

1 17. (New) The apparatus of claim 15 wherein said memory command 

2 packet is a row command. 

1 18. (New) The apparatus of claim 15 wherein said memory command 

2 packet is a column command. 



2 



1 19. (New) The apparatus of claim 15 wherein said scheduler further 

2 comprises logic to determine when resource conflicts may arise. 

1 20. (New) The apparatus of claim 13 wherein said host side memory 

2 controller region further comprises a second memory command packet chunk 

3 output. 

1 21. (New) The apparatus of claim 20 wherein said second clock is faster 

tn 2 than said first clock. 

? V s ! 

ffj 1 22. (New) The apparatus of claim 20 wherein said host side memory 

*§j 2 controller region is configured to present a second memory command packet 

\ " ? 3 chunk upon said second memory command packet chunk output, said second 

ijj 4 memory command packet chunk a portion of a second memory command 

-jp 5 packet. 

1 23. (New) An apparatus, comprising: 

2 a) a memory controller comprising a host side region and a memory 

3 side region, said host region having a memory access request input and a 

4 memory command packet chunk output, a memory command packet chuck 

5 being a portion of a memory command packet, said host side region clocked by a 

6 first clock, said memory side region having a memory command packet chunk 

7 input coupled to said memory command packet chunk output, said memory side 
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8 region clocked by a second clock, said second clock different than said first clock; 

9 and 

10 b) a memory coupled to said memory side region. 



1 24. (New) The apparatus of claim 23 further comprising an external 

2 agent configured to read and write to said memory via said memory controller. 



1 25. (New) The apparatus of claim 24 wherein said external agent 

2 further comprises a processor. 

■I/a*,? 

l|i 1 26. (New) The apparatus of claim 24 wherein said external agent 

111 

2 further comprises a graphics subsystem. 

1 u 1 27. The apparatus of claim 24 wherein said external agent further 

lj 2 comprises an expansion bus master. 

Q 1 28. (New) The apparatus of claim 23 wherein said memory command 

2 packet chunk output further comprises a row output and a column output. 



1 29. (New) The apparatus of claim 23 wherein said host side memory 

2 controller region further comprises a scheduler coupled to said memory access 

3 request input, said scheduler configured to generate said memory command 

4 packet. 



1 30. (New) The apparatus of claim 29 wherein said scheduler is coupled 

2 to a queue. 
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1 31. (New) The apparatus of claim 29 wherein said memory command 

2 packet is a row command. 

1 32. (New) The apparatus of claim 29 wherein said memory command 

2 packet is a column command. 

1 33. (New) The apparatus of claim 29 wherein said scheduler further 

2 comprises logic to determine when resource conflicts may arise. 

1 34. (New) The apparatus of claim 23 wherein said host side memory 

2 controller region further comprises a second memory command packet chunk 

3 output. 

1 35. (New) The apparatus of claim 34 wherein said second clock is faster 

2 than said first clock. 

1 36. (New) The apparatus of claim 34 wherein said host side memory 

2 controller region is configured to present a second memory command packet 

3 chunk upon said second memory command packet chunk output, said second 

4 memory command packet chunk a portion of a second memory command 

5 packet. 

1 37. (New) A method, comprising: 

2 a) generating a memory command packet from a memory request 

3 while clocked by a first clock; and 
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4 b) sending a memory command packet chunk to a memory controller 

5 region clocked by a second clock, a memory command packet chuck being a 

6 portion of said memory command packet, said second clock different than said 

7 first clock. 

1 38. (New) The method of claim 37 further comprising sending a second 

2 memory command packet chunk along with said memory command packet 

3 chunk, said second memory command packet chunk associated with a 
35 4 subsequent memory command packet. 

iff 

j|i 1 39. (New) The apparatus of claim 37 wherein said memory command 

-sijas 

*S 2 packet further comprises a row command. 

% 1 40. (New) The apparatus of claim 39 wherein said row command 

I** 2 activates a memory row. 

Q 1 41. (New) The apparatus of claim 39 wherein said row command 

2 precharges a memory row. 

1 42. (New) The apparatus of claim 39 wherein said memory command 

2 packet further comprises a column command. 

1 43. (New) The apparatus of claim 39 wherein said column command is 

2 used to read from a memory device. 

1 44. (New) The apparatus of claim 39 wherein said column command is 

2 used to write to a memory device. 

6 



COMMENTS 

Claims 1 through 12 have been canceled without prejudice. New claims 13 
through 44 have been added. Applicant respectfully requests the examination of 
new claims 13 through 44 and allowance of same. 



Respectfully submitted, 
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CROSS-CLOCK DOMAIN DATA TRANSFER METHOD AND APPARATUS 

FIELD OF THE INVENTION 

The present invention relates to the field of data transfer, and more 
particularly to transferring information between circuits that are operated in 
5 different digital clock domains. 

BACKGROUND OF THE INVENTION 

In modern electronic devices, it is often necessary to transfer data 
between circuits in different digital clock domains. Although in many cases 
the clocks in the different domains are entirely unrelated, in some cases the 
IJjlO clocks are generated from a common base clock or otherwise have a 
l|| frequency relationship that may be used to time the transfer of information. 
''jjz For example, in at least one prior art memory system, a reference clock signal 
is used to generate both a memory clock 17 and a host clock 16. The memory 
I* clock 17 is used to time the transfer of data and commands to one or more 
■^15 memory devices via a high-speed, packet-oriented communication channel. 
^ The host clock 16 is used to manage memory access requests received from 
W external agents and to format the requests into appropriate command and 

data packets for consumption in the memory clock 17 domain. To transfer a 
command or data packet to the memory clock 17 domain, control logic in the 
20 host clock 16 domain asserts the packet to logic in the memory clock 17 

domain for as many cycles of the host clock 16 as required for the memory 
clock 17 domain to consume the packet. 

One disadvantage of the above-described packet transfer technique is 
that, except for certain memory clock 17 to host clock 16 frequency ratios, the 
25 control logic in the host clock 16 domain will assert the packet to the 

memory clock 17 domain beyond the time necessary for the memory clock 17 
domain to consume the packet. For example, suppose that the frequency 
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ratio between the memory clock 17 and the host clock 16 is such that the 
memory clock 17 domain can consume three-fourths of a packet during each 
cycle of the host clock 16. Because the memory clock 17 domain cannot 
consume an entire packet during a cycle of the host clock 16, the control logic 

5 in the host clock 16 domain outputs each packet for at least two cycle of the 
host clock 16, even though the memory clock 17 domain will consume the 
data in only one and one-third cycle of the host clock 16 (each quarter of the 
packet is consumed in a third of a cycle of the host clock 16). Consequently, 
each packet is asserted for two-thirds of a cycle of the host clock 16 longer 

10 than necessary, thereby reducing throughput and increasing response latency 
in the memory system. 
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SUMMARY OF THE INVENTION 

A method and apparatus for transferring units of information 
between clock domains in a memory system are disclosed. A respective set of 
N units of information is loaded from an output circuit in a first clock 
domain to a storage circuit in a second clock domain during each clock cycle 
of the first clock domain. The output circuit selects each respective set of N 
units of information to include units of information that have previously 
been loaded into the storage circuit and that will not be output from the 
storage circuit prior to the storage circuit being loaded with a subsequent set 
of N units of information, and 

a complement number of units of information that have not previously 
been loaded into the storage circuit. 

Other features and advantages of the invention will be apparent from 
the accompanying drawings and from the detailed description that follows 
below. 
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BRIEF DESCRIPTION OF THE DRAWINGS 

The present invention is illustrated by way of example and not 
limitation in the figures of the accompanying drawings in which like 
references indicate similar elements and in which: 

Fig. 1 illustrates a memory system according to one embodiment; 

Fig. 2 illustrates an exemplary column command packet; 

Fig. 3 illustrates the row command logic of Fig. 1 according to one 
embodiment; 

Fig. 4 illustrates the hold logic and the steering logic of the row push 
controller of Fig. 3 according to one embodiment; 

Fig. 5 is a timing diagram that illustrates use of a sample select signal 
to load data from a push controller into a shift logic circuit; 

Fig. 6 is a timing diagram that illustrates use of a sample select signal 
to load data from a push controller into a shift logic circuit when there is a 
fractional number of memory clock cycles per host clock cycle; 

Fig. 7 illustrates pget logic within the push controller according to one 
embodiment; 

Fig. 8 is a diagram of a lane select state machine for generating the 
bubble select and live select signals according to one embodiment; 

Fig. 9 is a table that indicates the next value of the lane pointer for a 
given output lane; 

Fig. 10 is a logic diagram that illustrates generation of a hold select 
signal for a storage element in the hold logic; 

Fig. 11 illustrates an first exemplary shifting of packet chunks through 
the output lanes of the steering logic; 

Fig. 12 illustrates a second exemplary shifting of packet chunks 
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through the output lanes of the steering logic; 

Fig. 13 illustrates a third exemplary shifting of packet chunks through 
the output lanes of the steering logic; and 

Fig. 14 illustrates a fourth exemplary shifting of packet chunks 
through the output lanes of the steering logic. 
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DETAILED DESCRIPTION 

A memory controller that includes logic for efficiently transferring 
information between a host clock 16 domain and a memory clock 17 domain 
is disclosed. A scheduler 30 responds to memory access requests from 

5 external agents by formatting command packets and inserting the packets in 
packet queue 63s. Respective row push controller 27 accept the packets from 
the packet queue 63s and output the packets to a shift circuit in the memory 
clock 17 domain. The packets are shifted out of the shift circuit in fractional 
packet units referred to herein as "packet chunks." One packet chunk is 

10 shifted out of the shift circuit in the memory clock 17 domain per cycle of the 
memory clock 17. A packet chunk is said to be consumed by the memory 
clock 17 domain when it is shifted out of the shift circuit. 

To avoid outputting packets to the memory clock 17 domain for 
longer than necessary, the row push controller 27 are designed to manage 

15 packet transfer based on packet chunks, instead of whole packets. By 

managing packet transfer based on packet chunks, it becomes possible to 
increase the packet throughput of the host clock 16 domain, because after an 
initial portion of a packet is consumed, only the unconsumed portion of the 
packet is output in the subsequent clock cycle and the remaining output 

20 paths of the row push controller 27 are used to output an initial portion of a 
subsequent packet. Because packets are transferred from the host clock 16 
domain to the memory clock 17 domain with less delay, the overall response 
latency of the memory system may be reduced. 

Fig. 1 illustrates a memory system 10 according to one embodiment. 

25 The memory system 10 includes a memory controller 20, a memory array 11, 
a reference clock generator 12 and a memory clock generator 14. The 
reference clock generator 12 outputs a reference clock 15 (REFCLK) to the 
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memory controller 20 and to both the memory clock generator 14. The 
reference clock 15 may be divided to a reduced frequency before being 
supplied to the memory clock generator 14. The memory clock generator 14 
generates a memory clock 17 (MCLK) from the reference clock 15 and outputs 
5 the memory clock 17 to the memory array 11 and to the memory controller 
20. 

The memory controller 20 includes host-side control logic 21, 
memory-side control logic 22 and gear logic 23. The host-side control logic 21 
includes clock insertion logic 19 that generates a host clock 16 (HCLK) from 
10 the reference clock 15. In one embodiment, the clock insertion logic 19 

i'i-J 

# includes a phase-locked loop circuit to advance the phase of the host clock 16 

'ill il 

ly relative to the reference clock 15 to cancel delay caused by insertion of the 
■ifi reference clock 15 into the integrated circuit (IC) that contains the memory 
fy controller 20. The host clock 16 is used to clock logic units within the host- 
ing 15 side control logic 21, and the memory clock 17 is used to clock logic units 
2 within the memory-side control logic 22 and also to clock transfers of 
% command and data packets to the memory array 11. 
¥ The host side control logic 21 includes a scheduler 30 that receives 

requests to read and write the memory array 11 from external agents (e.g., 
20 processor, graphics subsystem, expansion bus master, and so forth). In one 
embodiment, the memory array 11 is a dynamic random access memory 
(DRAM) array in which packetized row commands are used to activate and 
precharge rows of memory cells and in which packetized column commands 
are used to read and write data in active rows. Consequently, the scheduler 
25 30 responds to the memory access requests from the external agents by 
generating an appropriate sequence of packetized row and column 
commands to carry out the requested access operation. For example, if the 
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scheduler 30 receives a request to access an inactive row in the memory array 
11, the scheduler 30 will output a packetized row activate command to 
activate the row followed by a packetized column access command to access 
the row. The scheduler 30 may also output a packetized row precharge 
5 command to precharge the row after the column access is completed. 

In some cases, the scheduler 30 may receive requests to access different 
rows within the same bank of the memory array 11. If the requests arrive 
near enough in time, they may conflict with one another, producing a 
resource conflict in the memory system 10. Other types of resource conflicts 
10 may also arise in memory system 10. When a resource conflict arises, it may 
^ be necessary to delay a memory access operation until a prior operation is 
'A\ completed or until the needed resource becomes available. In one 
% embodiment, the scheduler 30 includes logic to determine, based on 
'if* incoming requests, when resource conflicts may arise, and to take action to 
% 15 hold off memory operations by inserting a delay between command packets 
W as necessary. This delay is referred to herein as a bubble. According to one 
=f embodiment, the scheduler 30 includes a value called a bubble vector with 
Cl each command packet to indicate whether to delay the packet and, if delay is 
required, how long to delay the packet. The bubble vector is a value that 
20 indicates how many null packet chunks are to be inserted in the outgoing 

stream of chunks ahead of the packet (i.e., how many null chunks to prepend 
to the packet). In an alternate embodiment, the bubble vector could indicate 
how many null packet chunks should be appended to the rear of the packet. 
The bubble vector could also indicate both the number of null chunks to be 
25 prepended to the packet and the number of chunks to be appended to the 
packet. Bubble insertion is discussed in further detail below. 

In addition to generating command packets and bubble vectors, in at 
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least one embodiment, the scheduler 30 may also occasionally output a 
packet on a live bypass path 68 (not shown in Fig. 1). The scheduler 
indicates the presence of a packet on the live bypass path 68 by asserting a 
bypass signal 55 54. Live bypass operations are discussed below in further 
5 detail. 

The host-side control logic 21 includes row queue logic 25, row push 
controller 27, column queue logic 26 and column push controller 28. The 
scheduler 30 outputs row request command packets and associated bubble 
vectors to the row queue logic 25 which enqueues the row command packets 
10 and bubble vectors and outputs one row command packet and bubble vector 
*i at a time to the row push controller 27. The row queue logic 25 controller 27 
|t| outputs a packet-available signal 97 (PAVAIL) to the row push controller 27 
^jh to indicate the availability of valid packet, and the row push controller 27 
ill outputs a packet-get signal 98 (PGET) signal to the row queue logic 25 to 

■3'- 

i*% 15 request a new packet from the queue. The row push controller 27 outputs 
ajf the row command packets to row shift logic 35 in the memory-side control 

if?" 

jj: logic 22 which, in turn, shifts the command packets out to a high speed 
S request channel 43 (RQ) for delivery to the memory array 11. The row queue 
logic 25, row push controller 27 and row shift logic 35 are referred to 
20 collectively herein as the row command logic. 

The column queue logic 26, column push controller 28 and column 
shift logic 36 collectively form column command logic that operates 
similarly to the row command logic, except that the column command logic 
is used to deliver column command packets to the memory array 11. Also, 
25 the transmit data logic 31 and transmit data shift logic 33 collectively form 
data store logic that may be operated similarly to the row and column 
command logic, except that the data store logic is used to deliver write data 
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packets to the memory array 11 via a high speed data channel 45 (DQ). The 
receive data logic 32 and receive data shift logic 34 collectively form data 
retrieve logic that operates in an inverse fashion to the data store logic to 
retrieve data packets from the memory array 11 via the DQ channel 45 and to 
5 format the data packets for transfer to the requesting units via a data bus 
(DATA). 

The memory controller also includes gear logic 23 which generates a 
number of signals based on the frequency ratio of the host clock 16 to the 
memory clock 17. The signals generated by the gear logic 23 include a pair of 
10 clock feedback signals (HCFB, MCFB), a next memory clock count 52 
# (NXT_MCLK_CNT) and a sample select signal 53 (SAMP_SEL). In one 
I'y embodiment, the ratio of the host clock 16 frequency to the memory clock 17 
=| frequency (referred to as a "gear ratio") is indicated by hardwired signals set 
ill within or supplied to the memory controller 20. The gear logic 23 receives 
J f h 15 the host clock 16 and a subdivided version of the memory clock 17, referred 
|2 to herein as sample clock 18 (SCLK), and subdivides the host clock 16 and 
J: sample clock 18 to a common frequency to generate the pair of clock feedback 
m signals HCFB and MCFB. No subdivision is necessary if the host clock 16 and 
sample clock 18 are already at the same frequency. The clock feedback signals 
20 are output to the memory clock generator 14 which adjusts the phase of the 
memory clock 17 according to the phase difference between the clock 
feedback signals to control the phase relationship between the host clock 16 
and the memory clock 17. 

The gear logic 23 updates the next memory clock count 52 during each 
25 cycle of the host clock 16 based on the frequency ratio between the host clock 
16 and the memory clock 17. As discussed below, the next memory clock 
count 52 is a count value that indicates the number of packet chunks that 
Atty. Docket No. 42390.P6357 -10- 



will be shifted out of the shift logic circuits 35, 36 in the memory-side control 
logic 22 during the next cycle of the host clock 16. In one embodiment, the 
next memory clock count 52 is supplied to the row push controller 27 and the 
column push controller 28. In an alternate embodiment, the next memory 
5 clock count 52 may additionally be provided to the transmit data logic 31 and 
the receive data logic 32. 

The sample select signal 53 is output from the gear logic 23 to clock 
management logic 29 in the memory-side control logic 22. The sample select 
signal 53 is sampled by the memory-side control logic 22 during each cycle of 
10 the sample clock 18 and is used to determine which cycles of the memory 
# clock 17 in the subsequent sample clock 18 cycle are to be used to load data 
I'll from the row push controller 27 to the shift logic circuits 35, 36. The clock 
*i management logic 29 receives the memory clock 17 from a buffer amplifier 
ill 41 and subdivides the memory clock 17 by four to generate the sample clock 
13 15 18. Other subdividing values may be used in alternate embodiments. The 

clock management logic 29 also outputs a transmit clock 42 (TCLK) and a 
2 receive clock (RCLK). These clocks are operated at the memory clock 
^ frequency, but have a slightly different phase than the memory clock 17. 

Fig. 2 illustrates an exemplary column command packet 50. In one 
20 embodiment, the width of the column command packet is fixed at a 
predetermined number of bits according to the number of conductors 
allotted in the channel 43 to transmit the packet to the memory array 11. The 
length of the packet is also fixed at a predetermined number of bits to allow 
the packet to be output to the memory array 11 in a predetermined number 
25 of cycles of the memory clock 17. In the exemplary column command packet 
50, for example, five bits are transferred during each falling edge and each 
rising edge of the memory clock 17. By constraining column command 
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packets to have a fixed length of eight bits, any column command packet can 
be shifted from the column shift logic 36 to the appropriate conductors of 
channel 43 in four cycles of the memory clock. 

In one embodiment, row command packets and data packets each 
have the same packet length as the column command packet, but different 
packet widths. In discussions which follow, packet lengths are assumed to be 
fixed at eight bits so that a packet is consumed by the memory-side control 
logic 22 in four memory clock cycles. However, other packet dimensions 
may be used in alternate embodiments, including time-varying packet 
dimensions, without departing from the scope of the present invention. 

Each pair of bit columns in packet 50 constitute a unit of information 
referred to herein as a packet chunk. Generally, a packet chunk is the amount 
of information that is consumed by the memory-side control logic 22 in a 
single cycle of the memory clock 17. As mentioned above, the memory-side 
control logic 22 consumes a packet chunk each time it shifts a chunk out of a 
shift logic circuit 35, 36. The number of bits in a packet chunk depends upon 
the width of the packet and the number of shift operations per memory clock 
17 cycle. In the packet of Fig. 2, the packet is five bits wide and separate sets of 
five bits are shifted out to the channel 43 during each falling edge and each 
rising edge of the memory clock 17. Therefore, the column command packet 
50 includes four chunks of ten bits each. As discussed above, row command 
packets and data packets may have different widths, and alternate packet 
dimensions (including time varying dimensions) may be used in other 
embodiments. 

Fig. 3 illustrates the row command logic of Fig. 1 according to one 
embodiment. The column command logic is implemented in an analogous 
manner and will not be separately described. A similar structure may also be 
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used to transfer data between the host clock and memory clock domains. 

The row queue logic 25 includes a packet queue 63 and a queue 
controller 61. The packet queue 63 includes entries 1 to N, each capable of 
holding a complete packet and an associated bubble vector. The arrangement 
5 of packet chunks and the bubble vector in an entry of the packet queue is 
illustrated by Entry 1. In one embodiment, each entry in the packet queue 63 
includes an enable input (EN) that is strobed to load a packet. Multiplexers 
64A-64C are used to select either the preceding entry in the packet queue 63 
or the input packet path 70 to be the source of the loaded packet. The final 
10 entry in the packet queue, i.e., EntryN, is coupled directly to the input packet 

fii When a packet to be queued in the row queue logic 25 is present on 

J the input packet path 70, the scheduler 30 (see Fig. 1) outputs a load signal 72 
Jjl'J to the queue controller 61. The queue controller 61 responds by asserting an 
%15 appropriate source select signal 67 and load packet signal 68 to cause the 
ff packet to be loaded from the input packet path 70 into the lowest numbered 
=§; available entry in the packet queue 63, called the tail of the packet queue 63. 
13 If there are no available packet queue 63 entries (i.e., the packet queue 63 is 

full), the queue controller 61 issues a back down signal 74 to the scheduler 30 
20 to cause the scheduler 30 to retry the load operation in a subsequent host 

clock cycle. 

The lowest numbered entry in the packet queue 63 is referred to as the 
head of the queue. A packet and bubble vector at the head of the packet 
queue 63 are output to the row push controller 27 and are referred to herein 
25 as being "live". The queue controller 61 outputs a packet-available signal 97 
(PA VAIL) to indicate presence of a valid packet and bubble vector at the head 
of the packet queue 63. When the row push controller 27 is ready for a new 
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live packet and bubble vector, the row push controller 27 asserts a packet-get 
signal 98 (PGET) to the queue controller 61 to cause the queue controller 61 to 
advance the entries in the packet queue 63 so that the next packet in sequence 
is output at the head of the packet queue 63. The queue controller 61 
5 responds to the packet-get signal 98 by asserting respective source select 

signals and load packet signals to cause the contents of the packet queue 63 to 
be shifted by one entry toward the head of the packet queue 63. The value 
previously at the head of the packet queue 63 is overwritten and is said to be 
popped of the packet queue 63. 
10 In one embodiment, the queue controller 61 is implemented by a state 

.J machine that is clocked by the host clock 16 and that maintains a tail pointer 
i|p to track the tail of the packet queue 63. The tail pointer is decremented each 
J time a packet is popped off the head of the packet queue 63 in response to a 
If. packet-get signal 98, and the tail pointer is incremented each time a packet is 
M5 loaded into the packet queue 63 in response to a load signal 72. If the tail 
iy pointer points to the head of the packet queue 63, then the packet queue 63 is 
empty and the packet-available signal 97 is deasserted. If the tail pointer does 
Q not point to the head of the packet queue 63, then the packet-available signal 
97 is asserted to indicate that a live packet and bubble are present at the head 
20 of the packet queue 63. 

In one embodiment, the row push controller 27 includes hold logic 85, 
steering logic 83 and a push state machine 81. The hold logic 85 is used to 
store chunks of partially consumed packets and thereby allows a new packet 
to be advanced to the head of the packet queue 63, while preserving the yet 
25 unconsumed chunks of the previous packet. In one embodiment, the hold 
logic 85 receives the output of the packet queue 63 via chunk lines 72B, 72C 
and 72D. The hold logic 85 is also coupled to at least a portion of the live 
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bypass path 68 and, in certain circumstances, is used to store one or more 
chunks delivered via the live bypass path 68. A hold select signal 91 from 
the push state machine 81 is used to determine the source of chunks stored 
in the hold logic 85 during a each cycle of the host clock 16. 
5 The steering logic 83 receives each of the constituent chunks of a live 

packet directly from the head of the packet queue 63 via paths 72A-72D 
(indicated by the dashed-line paths crossing the hold logic 85 in Fig. 3), as 
well as any packet chunks stored in storage elements within the hold logic 
85. The steering logic 83 is also coupled to receive packet chunks from the 
10 live bypass path 68 when a live bypass operation occurs. Based on select 
41 signals from the push state machine 81, including a live select signal 92, 
1 y chunk select signal 93, bypass select signal 94 and bubble select signal 95, the 

steering logic 83 selects four packet chunks to be output to the row shift logic 
ill 35. The selected packet chunks may include null chunks. 
JUl5 The push state machine 81 receives the packet-available signal 97 from 

l*¥ s the queue controller 61, the next memory clock count 52 from the gear logic 
J; 23 (shown in Fig. 1) and the bypass signal 55 from the scheduler 30 (also 
Q shown in Fig. 1). In response to these input signals, the host clock 16, and the 
current state of the push state machine 81, the push state machine 81 
20 generates the above-mentioned select signals and the packet-get signal 98. 
The logic used to generate the select signals and the packet get signal 98 is 
discussed below. 

Fig. 4 illustrates the hold logic 85 and the steering logic 83 of the row 
push controller 27 of Fig. 3 in greater detail. In one embodiment, the hold 
25 logic 85 includes three storage elements H1-H3 and a pair of multiplexers 101, 
103. The purpose of the storage elements H1-H3 is to hold chunks of a 
partially consumed packet so that a new packet may be advanced to the head 
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of the packet queue 63 without losing the unconsumed chunks of the current 
live packet. This arrangement allows chunks from two different packets to 
be output to the memory-side control logic during a single host-clock cycle. 
As a matter of terminology, a partially consumed packet is a packet 
5 from which at least one chunk (e.g., chunkO) has been loaded into and 

shifted out of a shift logic circuit in the memory-side control logic 22 (i.e., at 
least one chunk has been consumed). According to one embodiment, the 
push state machine 81 does not assert the packet-get signal 98 to advance a 
new packet to the head of the packet queue 63 (i.e., get a new live packet) 
10 until at least one chunk of the current live packet is consumed by the 

memory-side control logic 22. Because packet chunks are consumed in order 
starting with ChunkO, it follows that a packet will not be popped off the head 
of the packet queue until at least ChunkO is consumed. Consequently, it is 
unnecessary to provide a storage element to store ChunkO, because if 
U 15 ChunkO of a packet has not been consumed, the packet will remain live and 
¥ ChunkO will be available at the head of the packet queue 63. 
| ; The multiplexers 101, 103 are provided because, at least for storage 

3 elements H2 and H3, there is more than one possible source of the packet 

chunk to be stored. For example, in the case of storage element H2, Chunk2 
20 may be provided on the live bypass path 68 or from the packet queue 63. 

Thus, a two-input multiplexer 103 is provided to select Chunk2 from one of 
these two sources during each cycle of the host clock 16. In the case of storage 
element H3, Chunk3 may be obtained from the live bypass path 68, from the 
packet queue 63, or from the storage element H3 itself. The three-input 
25 multiplexer 101 is provided to select between these different chunk sources. 
Both multiplexers 101, 103 are controlled by a hold select signal 91 output 
from the push state machine 81. The logic used to generate the hold select 
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signal 91 is discussed below. 

When a chunk is provided to the storage element H3 from the H3 
storage element itself, the chunk is said to be recirculated in the H3 storage 
element. Recirculation of a chunk in a storage element may be required 
5 whenever more than one cycle of the host clock 16 is needed to complete 
consumption of a partially consumed packet. That is, recirculation occurs 
when consumption of a packet spans three or more cycles of the host clock 16 
(e.g., chunkO consumed in a first clock cycle, chunkl and chunk2 consumed 
in a second clock cycle, and chunk3 consumed in a third clock cycle). In one 
jte 10 embodiment, the permitted range of host clock 16 to memory clock 17 
^| frequency ratios is restricted to ratios in which at least two packet chunks are 
f 0 consumed in the memory clock domain per cycle of the host clock 16. 
# Consequently, once a packet is minimally consumed (i.e., ChunkO 
ly consumed), at least chunkl and chunk2 will be consumed during the 
■Q 15 subsequent cycle of the host clock 16. Therefore only Chunk3 will need to be 
|J recirculated, and then only for certain host clock 16 to memory clock 17 

frequency ratios. In alternate embodiments in which a wider range of host 
^ clock 16 to memory clock 17 frequency ratios are permitted or in which larger 
packet sizes are used, recirculation of other packet chunks may be necessary 
20 and also it may be necessary to recirculate a given chunk multiple times. 

The output paths from the steering logic 83 to the memory-side 
control logic are referred to herein as output lanes. In one embodiment, the 
steering includes four output lanes, Lane0-Lane3, to allow a complement of 
packet chunks to be output during each cycle of the host clock 16. Because 
25 chunks supplied to the memory-side control logic via LaneO will be 

consumed before chunks supplied via Lanel (and so forth to Lane3), the 
output lanes are effectively prioritized, with LaneO having the highest 
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priority and Lane3 having the lowest priority. 

The steering logic 83 also includes three stages of multiplexers and a 
bubble insertion stage that effectively form lane pointers to select a respective 
chunk source for each output lane according to the select signals from the 
5 push state machine. During each cycle of the host clock 16, the multiplexer 
stages select one each of ChunkO, Chunkl, Chunk2 and Chunk3 from either 
one or two packets and steer the chunks to appropriate output lanes. If 
chunks from two different packets are to be output, then one or more chunks 
will be obtained from a live packet and the remaining chunks will be 
10 obtained from the hold logic. Chunks from the hold logic are from a 
0 partially consumed packet and therefore need to be consumed by the 
fij memory-side control logic before the chunks from the live packet. 
J Therefore, the one or more chunks from the hold logic are steered to higher 
iH priority output lanes than the chunks from the live packet. If chunks from a 
% 15 single packet are to be output, then the chunks will be obtained from either a 
W ii ve packet or the live bypass path. In either case, all four chunks are steered 
;|| straight through to the output lanes so that ChunkO is output on LaneO, 
C3 Chunkl is output on Lanel and so forth. 

The first stage of multiplexers 105 is used to select either the packet 
20 queue 63 (i.e., live data) or the hold logic 85 to be the chunk source for packet 
chunks Chunkl, Chunk2 and Chunk3. The live select signal 92 from the 
push state machine 81 determines this selection. In one embodiment, the 
live select signal 92 is a vector that includes three different values that 
provide independent control of each of the first stage multiplexers 105. 
25 The second stage of multiplexers 107 is used to rotate the four packet 

chunks in cases where at least one packet chunk is obtained from the hold 
logic 85. As discussed above, chunks from the hold logic are rotated to the 
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higher priority output lanes so that they are consumed before the chunks of a 
live packet. 

The third stage of multiplexers 109 is controlled by a bypass select 
signal 94 to select either the second stage multiplexers 107 or the live bypass 
path 68 as the chunk source of the output lanes. In one embodiment, live 
bypass occurs after the packet queue 63 has been emptied and all the packets 
have been consumed by the memory-side control logic 22. To live bypass 
path 68 is provided to deliver a packet directly to the third stage multiplexers 
109 and thereby avoid the delay that results from propagation of the packet 
through the packet queue and through the first two multiplexer stages of the 
steering logic 83. In this way, the row push controller 27 is rapidly primed 
with a new packet for output to the memory-side control logic 22. 
Subsequent packets may then be enqueued in the packet queue 63 and 
propagated through the steering logic 83 while the live bypass packet is being 
consumed by the memory-side control logic. 

The bubble insertion stage 111 is used to insert null chunks in the 
sequence of chunks consumed by the memory-side control logic 22. In one 
embodiment, zero to three null chunks are inserted between packets 
according to the operating constraints of the memory array and the memory- 
side control logic 22. Because null chunks are consumed by the memory-side 
control logic like packet chunks, each null chunk is used to inject a delay 
equal to the period of the memory clock 17. Thus, null chunks serve as 
placeholders to enforce time delays between successive packet transfers. Null 
chunks are also output to the memory-side control logic 22 when no packets 
are forthcoming from the scheduler 30 and the packet queue 63 has been 
emptied. As a matter of terminology, a chunk from a command packet, a 
chunk from a data packet and a null chunk are all considered to be units of 
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information that are supplied to the memory-side control logic 22. 

In one embodiment, the bubble insertion stage 111 includes four AND 
logic elements 112A-112D, each of which is independently controlled by a 
bubble select signal 95 to null the chunk supplied to the corresponding 
5 output lane. When a line of the bubble select signal 95 is held high, for 
example at AND logic element 112 A, then the output of the corresponding 
stage three multiplexer is passed through the AND logic element 112 A to the 
output lane. When the same line of the bubble select signal 95 is driven low 
at the AND logic element 112 A, the chunk output at LaneO is forced low (i.e., 
10 nulled). The other output lanes can similarly be nulled. Other structures for 

s§ outputting null data on the output lanes may be used in alternate 

fo embodiments. Also, in an alternate embodiment, the bubble insertion stage 
111 may be positioned elsewhere in the sequence of stages, for example, 

i|j between the second stage multiplexers 107 and the third stage multiplexers 

:Ll5 109. 

]*¥ Reflecting briefly on the operation of the push controller 27 note that, 

+ even though a fixed number of packet chunks are loaded into the shift logic 
l!3 of the memory-side control logic during each host clock cycle, not all of the 
packet chunks will be shifted out of the shift circuit (i.e., consumed) before 
20 the load operation takes place for the next host clock cycle (at least for certain 
host clock to memory clock frequency ratios). In effect, the push controller 27 
tracks the state of the packet chunks in the shift circuit and steers copies of 
the unconsumed chunks to the appropriate output lanes so that, when the 
subsequent load operation takes place, any unconsumed chunks in the shift 
25 logic are overwritten with duplicate values. Consequently, new chunks are 
loaded into vacant entries of the shift circuit, while unconsumed chunks in 
the shift circuit are unaffected. In this way, a full complement of packet 
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chunks is output from the push controller and loaded into the shift circuit 
during each cycle of the host clock, but only the vacant entries of the shift 
logic are updated with new chunks. 

Having described the hold logic 85 and the steering logic 83, in bears 
5 emphasis that many of the signal paths in the hold logic 85 and the steering 
logic 83 are multiple conductor paths, with the total number of conductors 
being determined by the number of bits per chunk. For example, a packet 
having ten bits per chunk would require ten conductors per chunk path 
through the hold logic 85 and the steering logic 83. The storage elements Hl- 
10 H3 in the hold logic 85 would each be required to store a bank of ten bits, and 
m the AND logic elements 112A-112D in the bubble insertion stage of the 
f 0 steering logic 83 would each need to be able to null ten bits of data. Signal 

paths having more or fewer conductors may be used for packets having more 
||| . or fewer bits per chunk. 

f% 15 Fig. 5 illustrates the manner in which the sample select signal 53 

(SAMP_SEL) is used to load data from a push controller into a corresponding 
2 shift logic circuit in the memory-side control logic. In the example shown, 
O the ratio of the host clock 16 (HCLK) to the sample clock 18 (SCLK) is 4:3. In 
one embodiment, the sample clock 18 is generated by dividing the memory 
20 clock 17 (MCLK) by four so that there are four cycles of the memory clock 17 
for each cycle of the sample clock 18 and three cycles of the memory clock 17 
for each cycle of the host clock 16. Because of the 4:3 ratio of the host clock 16 
to the sample clock 18, rising edges of the host clock 16 and sample clock 18 
are aligned on a beat frequency that is one-fourth the host clock 16 frequency 
25 and one-third the sample clock 18 frequency. 

In one embodiment, the sample select signal 53 is sampled by the clock 
management logic (element 29 of Fig. 1) in the memory clock 17 domain in 
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response to each falling edge of the sample clock 18. These sample points are 
indicated in Fig. 5 by the arrows above the falling edges of the sample clock 
18. The sample select signal 53 is a vector in which each bit corresponds to a 
respective memory cycle in the subsequent cycle of the sample clock 18 and 
indicates whether or not to load the output of the push controller into the 
corresponding shift logic circuit during that memory cycle. 

The push controller output begins to transition in response to each 
rising edge of the host clock 16. Because a relatively constant time passes 
before the push controller output begins to change, and because it takes a 
relatively constant time Ts for the push controller output to settle, the push 
controller output remains valid for a time before and after each subsequent 
rising edge of the host clock 16. This is illustrated in Fig. 5 by the Dat_0- 
Dat_5 data valid windows. Thus, by causing the memory-side control logic 
to sample the push controller output in response to a rising edge of the 
memory clock 17 that coincides with the rising edge of the host clock 16, a 
valid set of chunks from the push controller will be loaded into the 
corresponding shift logic circuit of the memory-side control logic during each 
cycle of the host clock 16. Therefore, the sample select signal 53 output to the 
memory-side control logic during each cycle of the sample clock 18 is 
assigned a value that indicates which of the four memory cycles in the 
subsequent sample clock period will coincide or most nearly coincide with a 
rising edge of the host clock 16 signal. For example, the second of four 
memory cycles is aligned with a rising edge of the host clock 16 in the second 
cycle of the sample clock 18. Consequently, the sample select signal 53 output 
during the first sample clock 18 signal has a value of 0010 (in this notation, 
the rightmost digit corresponds to the first memory clock 17 cycle of the 
sample clock 18 period). During the third cycle of the sample clock 18, the 
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first and fourth cycles of the memory clock 17 are aligned with rising edges of 
the host clock 16. Consequently, the sample select signal 53 output during 
the second sample clock 18 signal has a value of 1001, indicating that the shift 
logic circuit should be loaded twice during the third cycle of the sample clock 

5 18. During the fourth cycle of the sample clock 18, the third memory clock 17 
cycle is aligned with a rising edge of the host clock 16. Consequently, the 
sample select signal 53 output during the third sample clock 18 signal has a 
value of 0100. Because the phase relationship between the host clock 16 and 
the sample clock 18 repeats every three cycles of the sample clock 18, the 

10 sample select signal 53 will be output in a repeating pattern of 0010, 1001, 
0100. 

Fig. 6 is a timing diagram that illustrates the values of the sample 
select signal 53 for a 3:2 frequency ratio between the host clock 16 and the 
sample clock 18. In this timing arrangement, number of cycles of the 

15 memory clock 17 per cycle of the host clock 16 is a fractional value, so that, 
except for at the beat period (every third cycle of the host clock 16), the 
memory clock 17 does not exactly coincide with a rising edge of the host clock 
16. In one embodiment, this circumstance is managed by extending the time 
during which the push controller output is valid in order to allow for 

20 sampling on cycles of the memory clock 17 that somewhat precede the rising 
edge of the host clock 16. The valid push controller output is extended, in 
one implementation, by shortening the time required for the push controller 
output to begin transitioning, Tbtv or by shortening the time required for the 
push controller output to settle, Ts, or by shortening both Tbt and Ts- As 

25 shown in Fig. 6, the worst case offset between a sampling edge of the memory 
clock 17 and a rising edge of the host clock 16 occurs during the third cycle of 
the sample clock 18. At that point the sampling edge of the memory clock 17 
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leads the host clock by two-thirds of a memory clock cycle. As illustrated in 
Fig. 6, this worst-case sampling point and the setup time Tsu of the shift logic 
in the memory-side control logic (i.e., the time for which the push controller 
output must be valid before it can be loaded into the shift logic) determine 

5 the maximum of the sum of Tbt and Ts- 

Another interesting consequence of the fractional ratio between the 
frequencies of the memory clock 17 and the host clock 16 is that a varying 
number of cycles of the memory clock 17 occur between sample points. 
Consequently, the number of packet chunks consumed by the memory-side 

10 control logic per cycle of the host clock 16 varies in a repeating pattern of 2, 3, 
3, 2, 3, 3. This is the pattern of the next memory clock count 52 signal output 
by the gear logic 23. By contrast, the next memory clock count 52 output in 
the 4:3 clock ratio of Fig. 5 remains constant at three. 

Still referring to Fig. 6, the sample select output during the first cycle of 

15 the sample clock 18 is 1001, indicating that the push controller output is to be 
sampled twice during the second cycle of the sample clock 18 (at memory 
clock cycles one and four). The sample select output during the second cycle 
of the sample clock 18 is 0010, indicating that the push controller output is to 
be sampled once during the third cycle of the sample clock 18 (at memory 

20 clock cycle two). Because the phase relationship between the host clock 16 
and the sample clock 18 repeats every two cycles of the sample clock 18, 
sample select signal 53 will be output in a repeating pattern of 1001, 0010. 

Fig. 7 illustrates pget logic 99 within the push controller according to 
one embodiment. During each cycle of the host clock, the pget logic 99 

25 determines the number of chunks that will be consumed by the memory- 
side control logic and the number of chunks pending in the push controller 
and determines whether the push controller will need a new live packet 
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during the subsequent cycle of the host clock. If so, the packet-get signal 98 is 
asserted, otherwise the packet-get signal 98 is deasserted. 

The pget logic includes two state elements that are each updated once 
per cycle of the host clock: a chunk pending register 153 and a previous pget 
5 register 151. The chunk pending register 153 indicates the number of chunks 
of packet data and bubbles that the push controller is processing in the 
current host clock. The previous pget register asserts a signal 107 indicating 
the state of the packet-get signal 98 during the previous cycle. If the previous 
pget register 151 indicates that the packet-get signal was asserted during the 
10 previous cycle and if the packet-available signal 97 is currently asserted, then 
the value of the live bubble vector 73 (i.e., from the packet queue) is 
f*i combined with a chunks pending signal 106 from the chunk pending register 
J 153 using adder 152. The next memory clock count 52 is then subtracted from 
ifj the output of the adder 152 in subtracter 154. If the subtraction result is 
% 15 negative, a borrow signal 108 is asserted, indicating that there are not enough 
^ chunks pending (including null chunks indicated by the bubble vector) to 
*jp supply the number of chunks that will be consumed in the next cycle of the 
Q host clock 16. In that event, the packet-get signal 98 is asserted. In the 

embodiment depicted in Fig. 7, the borrow signal 108 is the packet-get signal. 
20 In alternate embodiments, additonal logic may be interposed between the 
borrow signal 108 and the packet-get signal 98 (e.g., logic to force packet-get 
high in certain instances). When the borrow signal is asserted, path B in 
multiplexer 158 is selected to provide the next chunks pending value. If the 
packet-available signal 97 is asserted, then the next chunks pending value 
25 will be the output of the subtractor 154 plus the number of chunks per packet 
(e.g., four — contributed by adder 156), supplied via path B of multiplexer 160. 
If the packet-available signal is not asserted, then the chunks pending count 

Atty. Docket No. 42390.P6357 -25- 



is forced to zero through path A of the multiplexer 160. In one embodiment, 
the chunks pending signal is used by gate the bypass select signal (e.g., signal 
94 in Fig. 4) so that live bypass is only permitted if there are no chunks 
pending in the push controller. 

5 Fig. 8 is a diagram of a lane select state machine 200 for generating the 

bubble select and live select signals according to one embodiment. The lane 
select state machine 200 is included within the push state machine and 
transitions from between states in response to transitions of the host clock. 
In one embodiment, a separate lane select state machine 200 is provided for 

10 each output lane of the push controller. 

The lane select state machine can enter one of three states, Packet 201, 
Bubble 203 and Hold 205 , and is initially reset to the Hold state 205. From a 
given state, Packet, Bubble or Hold, the lane select state machine can enter 
either of the other two states or remain in the same state. Generally, the 

15 bubble select signal and the live select signal for a given output lane are 

asserted according to the lane select state for that lane. If the lane select state 
is the Bubble state 203, the bubble select signal for the lane is asserted. If the 
lane select state is the Hold state 205, the live select signal for the lane is 
deasserted to select an output from the hold logic 85. If the lane select state is 

20 in the Packet state 201, the live select signal for the output lane is asserted to 
select an output from the live packet at the head of the packet queue, with 
one exception; if, while in the Packet state, the packet-available signal is true, 
the packet-get signal was asserted during the previous cycle of the host clock, 
the bubble width is nonzero, and the lane pointer is less than the bubble 

25 width, then the bubble select signal is asserted. The logical conditions for 
transitioning between the three states are shown in Fig. 8. The lane select 
state machine remains in the same state if neither of the conditions for 
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transitioning to the other two states is satsified. As discussed above, the new 
count (NEW_CNT) refers to the sum of the value of the lane pointer (0-3) 
for the output lane plus the next memory clock count. In the notation of Fig. 
8, the expression "bubble_width /=0 n refers to a nonzero bubble vector; the 
term MOD refers to the modulus operator (i.e., remainder of integer 
division); the expression "pavail = T" refers to an asserted packet-available 
signal; the character indicates a logical AND operation; the V character 
indicates a logical OR operation; and the T character indicates a logical NOT 
operation. 

As alluded to above, a lane pointer is used to indicate whether a given 
output lane is to receive ChunkO, Chunkl, Chunk2 or Chunk3 (bubble 
insertion overrides the delivery of a chunk to an output lane). Fig. 9 is a 
table that indicates the next value of the lane pointer (NXTJTR) for a given 
output lane. In one embodiment, the next lane pointer value is a function of 
the state of the corresponding lane select state machine and the indicated 
conditions. For example, as shown by row two of the table, if the lane select 
state machine for the output lane is in the Packet state, the next value of the 
lane pointer is equal to the new count (NEWJZNT) mod 4 (i.e., remainder of 
integer division of new count by four), if the new count is greater than three 
and the packet-available signal is asserted. As discussed above, the new 
count is the sum of the current value of the lane pointer and the next 
memory clock count. As another example, as shown by row eight of the 
table, if the lane state machine is in the Hold state, and the new count is 
greater than three, but the packet-available signal is deasserted (pavail == , 0 , ) / 
then the lane pointer will be assigned the lane id (i.e., zero for LaneO, 1 for 
Lanel, 2 for Lane2, and 3 for Lane3). According to one embodiment, the 
various conditions are combined with the state of the lane select state 
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machine for each output lane to generate the next lane pointer in accordance 
with the table in Fig. 9. In some cases, conditions may be combined for more 
efficient implementation. 

Jig. 10 is a logic diagram that illustrates generation of the hold select 
5 signal for the storage element H3 of hold logic 85. At decision block 171, the 
setting of the lane pointer for output LaneO is evaluated to determine if it 
points to storage element HI. If so, and if the next memory clock count is 
less than three, then the present content of storage element H3 will be 
needed during the next cycle of the host clock. Consequently, at block 173, 
10 the select pointer for storage element H3 (i.e., multiplexer 101 of Fig. 4) is 
m configured to recirculate the content of the H3 storage element. If the lane 
j|j pointer for output LaneO is not pointed at storage element HI, or if the next 
^| memory clock count is not less than three, then the bypass signal is 
Jjfs evaluated at decision block 175 to determine if a live bypass operation is to be 
% h l5 performed. If the bypass signal is asserted (and bypass has not been not 

disabled), then the select pointer for storage element H3 is configured at block 
177 to store chunk3 from the live bypass path. If the bypass signal is not 
asserted (or bypass has been disabled), then the select pointer for storage 
element H3 is configured at block 179 to store a live chunk3 from the packet 
20 queue. 

Fig. 11 illustrates an exemplary shifting of packet chunks through the 
output lanes of the steering logic when the frequency ratio of the memory 
clock 17 to the host clock 16 is 3:1. At the start of Cyclel of the host clock 16, 
the entry at the head of the packet queue contains packet A, including 
25 chunks AO- A3. The entry at the head of the packet queue also contains a 

bubble vector indicating that zero null chunks are to be prepended to packet 
A before it is output to the memory-side control logic 22. Sometime later in 
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Cyclel of the host clock 16, chunks AO- A3 of packet A appear at output lanes 
Lane0-Lane3, respectively, of the steering logic. The subscript "L" on each of 
the chunks of packet A at the steering logic output indicates that the chunks 
are obtained from a live packet (i.e., a packet at the head of the packet queue). 
5 Because the packet queue contains at least packet A, the packet-available 
signal (PA VAIL) is asserted. Also, because the next memory clock count 
(abbreviated as NC in Fig. 11) is three, the push state machine determines 
that at least one chunk of the current live packet (i.e., packet A) will be 
consumed by the shift logic after the current set of chunks (setl) is loaded 

10 into the shift logic and before the next set of chunks (set2) is loaded into the 
shift logic (i.e., at least one chunk of packet A will be "consumed before 
subsequent load")- Consequently, the push state machine asserts the packet- 
get signal (PGET) to advance the next packet to the head of the packet queue. 
In response to the packet-get signal, packet B becomes the new live 

15 packet during Cycle2 of the host clock 16 and the packet-available signal 

remains asserted. Because chunks A0-A2 of packet A have been consumed, 
but not A3, A3 is repeated in the steering logic output during Cycle2 of the 
host clock. Because packet A is no longer live, A3 is selected from a storage 
element in the hold logic (hence the subscript H) and rotated to LaneO, the 

20 highest priority output lane. Because the bubble vector associated with live 
packet B indicates that no null chunks are to be prepended to packet B, the 
first three chunks of live packet B (i.e., BO, Bl and B2) are also presented at 
the steering logic output in Cycle2, but at the lower priority output lanes, 
Lanel-Lane3. Because the next memory clock count is three, the push state 

25 machine determines that chunks BO and Bl of packet B (along with A3 of 

packet A) will be consumed before subsequent load. Consequently, the push 
state machine asserts the packet-get signal (PGET) to advance the next packet 
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to the head of the packet queue. 

In response to the packet-get signal, packet C becomes the new live 
packet during Cycle3 of the host clock 16 and the packet-available signal 
remains asserted. Because chunks A3, BO and Bl will be consumed prior to 
5 the next load operation, chunks B2 and B3 are selected from the hold logic 
(packet B is no longer live) and steered to the LaneO and Lanel outputs, 
respectively. Because the bubble vector associated with live packet C 
indicates that no null chunks are to be prepended to packet C, chunks CO and 
CI of the live packet C are steered to the Lane2 and Lane3 outputs. Further, 

10 because the next memory clock count is three, the push state machine 

determines that chunk CO of packet C (as well as chunks B2 and B3 of packet 
B) will be consumed before subsequent load. Consequently, the push state 
machine asserts the packet-get signal to advance the next packet to the head 
of the packet queue. 

15 In response to the packet-get signal , packet D becomes the new live 

packet in Cycle4 of the host clock 16 and the packet-available signal remains 
asserted. Because chunks B2, B3 and CO will be consumed prior to 
subsequent load, chunks CI, C2 and C3 are selected from the hold logic 
(packet C is no longer live) and steered to the LaneO, Lanel and Lane2 

20 outputs, respectively. Because the bubble vector associated with live packet D 
indicates that no null chunks are to be prepended to packet D, chunk DO of 
the live packet D is steered to the Lane3 output. The next memory clock 
count remains at three, so that the push state machine determines that 
chunks CI, C2 and C3 will be consumed before subsequent load, but not 

25 chunk DO. Consequently, the push state machine deasserts the packet-get 
signal (shown by !PGET in Fig. 11) so that packet D remains live. Thus, 
during CycleS of the host clock 16, packet D remains at the head of the packet 
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queue and chunks D0-D3 are output at Lane0-Lane3, respectively, later in 
cycle 5. 

Fig. 12 illustrates another exemplary shifting of packet chunks through 
the output lanes of the steering logic at the 3:1 memory clock 17 to host clock 
5 16 ratio, but in this example, the bubble vectors for packets B and D are 
nonzero. 

During Cyclel of the host clock 16, live packet A is steered to the 
output lanes by the steering logic. The packet-available signal is asserted 
because the packet queue contains a packet, and the packet-get signal is 
10 asserted because chunks AO, Al and A2 will be consumed before subsequent 
ft load (next memory clock count is three in each cycle of this example). 
t| In response to the packet-get signal , packet B becomes the new live 

ig packet in Cycle2 of the host clock 16 and the packet-available signal remains 
fi asserted. Unconsumed chunk A3 is selected from the hold logic (packet A is 

15 no longer live) and steered to the LaneO output. Because the bubble vector 
J associated with packet B indicates that three null chunks are to be prepended 
!; to packet B, the Lanel, Lane2 and Lane3 outputs are nulled. These null 
3 chunks are indicated by the labels BUBO, BUB1, and BUB2 in Fig. 12. Because 
no chunks from packet B will be consumed before subsequent load, the 
20 packet-get signal is deasserted in Cycle2 of the host clock 16 so that packet B 
remains live in Cycle3. Because the next memory clock count is three, 
chunks A3, BUBO and BUB1 will be consumed before subsequent load, but 
not BUB2. Consequently, during Cycle3 of the host clock 16, BUB2 (i.e., a 
null chunk) is output at LaneO, and chunks BO, Bl and B2 of live packet B are 
25 output at Lanel, Lane2 and Lane3, respectively. Because chunks BO and Bl of 
packet B will be consumed before subsequent load, the packet-get signal is 
asserted to advance packet C to the head of the queue in Cycle4. The bubble 
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vector associated with packet C is zero, indicating that no null chunks are to 
be prepended to packet C Consequently, the unconsumed chunks of packet 
B (i.e., B2 and B3) are steered from respective storage elements in the hold 
logic to the LaneO and Lanel outputs, while chunks CO and CI from live 
5 packet C are steered to the lower priority Lane2 and Lane3 outputs. Because 
chunks B2, B3 and CO will be consumed before subsequent load, the packet- 
get signal is asserted to advance the next packet to the head of the packet 
queue and shifting of chunks through the output lanes of the steering logic 
83 continues. 

10 Fig. 13 illustrates the operation of the row push controller 27 when the 

O frequency ratio of the memory clock 17 to the host clock 16 is 8:3. In this 
W clock ratio, a varying number of packet chunks are consumed per host clock 
*f 16. During Cyclel of the host clock 16, packet A appears at the head of the 
;jp packet queue with a bubble vector of three. Assuming that there are no 
, s 15 chunks from a preceding packet that remain to be output, null chunks BUBO, 
{a BUBl and BUB2 are output on LaneO, Lanel and Lane2 of the steering logic, 

and AO of live packet A is output on Lane3. The next memory clock count 
JjJ (NC) is two, so that null chunks BUBO and BUBl will be consumed before 

subsequent load, but not chunks BUB2 or AO. Because none of the packet A 
20 chunks will be consumed before subsequent load, the packet-get signal is 

deasserted so that packet A remains the live packet during Cycle2 of the host 

clock 16. 

In Cycle2, null chunk BUB2 is output on LaneO of the steering logic 
and chunks AO, Al and A2 from live packet A are output on Lanel, Lane2 
25 and Lane3, respectively. The next memory clock count is three, so that 
chunks BUB2, AO and Al will be consumed before subsequent load. 
Consequently, the packet-get signal is asserted to advance packet B to the 

Atty. Docket No. 42390.P6357 -32- 



head of the packet queue for Cycle3 of the host clock 16. 

During Cycle3, the unconsumed chunks from packet A (i.e., A2 and 
A3) are steered from respective storage elements in the hold logic to the 
LaneO and Lanel outputs of the steering logic. Because the bubble vector 
associated with live packet B is zero, chunks BO and Bl from packet B are 
output at Lane2 and Lane3, respectively. The next memory clock count is 
three, so that chunks A2, A3 and BO will be consumed before subsequent 
load. Consequently, the packet-get signal is asserted to advance packet C to 
the head of the packet queue for Cycle4 of the host clock 16. 

During Cycle4, the unconsumed chunks from packet B (i.e., Bl, B2 and 
B3) are steered from respective storage elements in the hold logic to the 
LaneO, Lanel and Lane2 outputs. Because the bubble vector associated with 
live packet C is zero, chunk CO from packet C is output at Lane3. The next 
memory clock count is two, so that chunks Bl and B2, but not B3 or CO, will 
be consumed before subsequent load. Because B3 was obtained from the hold 
logic during Cycle4, but will be needed for yet another cycle of the host clock 
16, B3 is recirculated in the hold logic 85. Because none of the chunks of 
packet C will be consumed before subsequent load, the packet-get signal is 
deasserted during Cycle4 so that packet C will remain live during CycleS of 
the host clock 16. 

In CycleS, the recirculated chunk B3 is steered from the hold logic to 
the LaneO output of the steering logic, and chunks CO, CI and C2 are steered 
from the head of the packet queue to the Lanel, Lane2 and Lane3 outputs, 
respectively. The next memory clock count is three, so that chunks B3, CO 
and CI will be consumed before subsequent load. Consequently, the packet- 
get signal is asserted to advance a new packet to the head of the packet queue 
for the next cycle of the host clock 16 and rotation of packet chunks through 
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the output lanes continues. 

Fig. 14 illustrates a live bypass operation. During Cyclel of the host 
clock 16, packet A is the live packet and chunks A0-A3 are output at LaneO- 
Lane3 of the steering logic. The packet-available signal is asserted to indicate 
5 that a packet is present in the packet queue. The next memory clock count is 
three, so that chunks AO, Al and A2 will be consumed before subsequent 
load. Consequently, the packet-get signal is asserted to advance packet B to 
the head of the packet queue for Cycle2 of the host clock 16. As shown, the 
bypass signal is deasserted (1BYPASS) during Cyclel. (Although the state of 
10 the bypass signal was not explicitly shown in Figs. 10, 11 and 12, the bypass 
■k§ signal was assumed to be deasserted throughout those examples). 
m In Cycle2 of the host clock 16, previously loaded, but unconsumed 

•fj chunk A3 is steered from the hold logic to LaneO of the steering logic, and 
JJ chunks BO, Bl and B2 are steered from the packet queue to the Lanel, Lane2 

15 and Lane3 outputs, respectively. The packet-available signal remains 
W asserted because packet B is present at the head of the packet queue. The next 
# memory clock count is three, so that chunks A3, BO and Bl will be consumed 
O before subsequent load. Consequently, the packet-get signal is asserted to 

advance the next packet to the head of the packet queue for Cycle3 of the host 
20 clock 16. The bypass signal remains deasserted during Cycle2. 

Assuming that packet B was the last packet in the packet queue, then 
the assertion of the packet-get signal during Cycle2 of the host clock 16 causes 
the packet queue to become empty during Cycle3. Consequently, the packet- 
available signal is deasserted to inform the push state machine that there is 
25 no packet at the head of the packet queue. The push state machine responds 
to the deassertion of the packet-available signal by steering the unconsumed 
chunks of packet B (i.e., B2 and B3) to LaneO and Lanel of the steering logic, 
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and by steering null chunks BUBO and BUB 1 to Lane2 and Lane3. Because 
the next memory clock count is two, chunks B2 and B3 will be consumed 
before subsequent load, but not the null chunks BUBO and BUB1. The 
packet-get signal remains asserted to indicate the need for a live packet. The 
bypass signal remains deasserted. 

During Cycle4 of the host clock 16, the packet-available signal remains 
deasserted and the push state machine responds by outputting null chunks 
(i.e., BUB0-BUB4) on all four lanes of the steering logic. The next memory 
clock count is three, so that three null chunks (BUBO, BUB1 and BUB2) will 
be consumed before subsequent load. Note that, despite the insertion of null 
chunks, a steady stream of chunks continues to be output to the memory- 
side control logic. The packet-get signal remains asserted and the bypass 
signal remains deasserted during Cycle4. 

During Cycle5 of the host clock 16, the packet-available signal remains 
deasserted, but the bypass signal is asserted to indicate that a packet is 
available on the live bypass path. The push state machine responds to the 
bypass signal by steering chunks C0-C3 of packet C from the bypass path to the 
Lane0-Lane3 outputs of the steering logic. The next memory clock count is 
three, so that chunks CO, CI and C2 will be consumed before subsequent load. 
Chunk C3 is stored in the hold logic. The packet-get signal remains asserted 
to indicate that a new live packet is needed. 

In cycle6 of the host clock 16, packet D is made available at the head of 
the packet queue 63, so that the packet-available signal is asserted. Chunk C3 
from packet C will not be consumed before the output chunk set of cycle 6 is 
loaded. Consequently, chunk C3 is steered from the hold logic to LaneO of 
the steering logic, while chunks DO, Dl and D2 are steered from the head of 
the packet queue to the Lanel, Lane2 and Lane3 outputs, respectively. The 
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next memory clock count is two, so that chunks C3 and DO will be consumed 
before subsequent load. Consequently, the packet-get signal is asserted to 
advance a new packet, if available, to the head of the packet queue. The 
bypass signal is deasserted to indicate the absence of a packet on the live 
bypass path. 

In the foregoing specification, the invention has been described with 
reference to specific exemplary embodiments thereof. It will, however, be 
evident that various modifications and changes may be made to the specific 
exemplary embodiments without departing from the broader spirit and scope 
of the invention as set forth in the appended claims. Accordingly, the 
specification and drawings are to be regarded in an illustrative rather than a 
restrictive sense. 
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CLAIMS 

What is claimed is: 



1 1. A method of transferring units of information between clock domains 

2 in a memory system, the method comprising: 

3 loading a respective set of N units of information from an output circuit 

4 in a first clock domain into a storage circuit in a second clock 

5 domain during each clock cycle of the first clock domain; and 

6 the output circuit selecting each respective set of N units of information 

7 to include 

8 units of information that have previously been loaded into the 

9 storage circuit and that will not be output from the storage 

10 circuit prior to the storage circuit being loaded with a 

1 1 subsequent set of N units of information, and 

12 a complement number of units of information that have not 

13 previously been loaded into the storage circuit. 

1 2. The method of claim 1 wherein the storage circuit is a shift circuit and 

2 wherein units of information are shifted out of the shift circuit in 

3 response to transitions of a clock signal in the second clock domain. 

1 3. The method of claim 1 further comprising generating a count value that 

2 indicates how many units of information M of each set of N units 

3 loaded into the storage circuit will be output from the circuit prior to the 

4 storage circuit being loaded with a subsequent set of N units of 

5 information, wherein the complement number of units of information 
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6 



includes M units of information. 



1 4. The method of claim 3 wherein the count value is regenerated for each 

2 clock cycle of the first clock domain. 

1 5. The method of claim 4 wherein the count value remains constant over 

2 time if the frequency of the second clock domain is an integer multiple 

3 of the frequency of the first clock domain. 

1 6. The method of claim 4 wherein the count value varies over time if the 

2 frequency of the second clock domain is not an integer multiple of the 

3 frequency of the first clock domain. 

1 7. The method of claim 3 wherein M is determined based on relative clock 

2 frequencies of first clock domain and the second clock domain. 

1 8. The method of claim 1 wherein selecting each respective set of N units 

2 of information comprises selecting units of information from one or 

3 more of a plurality of information sources. 

1 9. The method of claim 8 wherein selecting units of information from one 

2 or more of a plurality of information sources comprises selecting units 

3 of information from one or more of an information queue, a set of hold 

4 registers and a source of null data. 

1 10. The method of claim 9 wherein the plurality of information sources 

2 further comprises a bypass source that bypasses the information queue. 
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11. The method of claim 1 wherein selecting each respective set of N units 
of information comprises issuing select signals to steering logic 83 to 
select a respective one of a plurality of sources to supply each unit of 
information in each respective set of N units of information. 

12. A method of transferring units of information between clock domains 
in a memory system, the method comprising: 

loading a first set of N units of information from an output circuit in a 
first clock domain into a shift circuit in a second clock domain 
during a first clock cycle of the first clock domain; 

generating a count value, the count value indicating how many units of 
information M of the first set of N units of information will be 
shifted out of the shift circuit prior to the shift circuit being loaded 
with a second set of N units of information in a second clock cycle 
of the first clock domain; and 

loading the second set of N units of information from the output circuit 
into the shift circuit during the second clock cycle, the second set of 
N units including M units of information not previously loaded 
into the shift circuit and N-M units of information from the first 
set of N units of information. 
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ABSTRACT OF THE DISCLOSURE 

An apparatus and method for transferring units of information 
between clock domains. A respective set of N units of information is loaded 
from an output circuit in a first clock domain into a storage circuit in a 
second clock domain during each cycle of the first clock domain. Each set of 
N units is selected by the output circuit to include (1) units of information 
that have previously loaded into the storage circuit and that will not be 
output from the storage output from the storage circuit prior to the storage 
circuit being loaded with a subsequent set of N units of information, and (2) a 
complement number of units of information that have not previously been 
loaded into the storage circuit. 
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Attorney's Docket No.: 042390. P6357 PATENT 

DECLARATION AND POWER OF ATTORNEY FOR PATENT APPLICATION 
(FOR INTEL CORPORATION PATENT APPLICATIONS) 

As a below named inventor, I hereby declare that: 

My residence, post office address and citizenship are as stated below, next to my name. 

I believe I am the original, first, and sole inventor (if only one name is listed below) or an 
original, first, and joint inventor (if plural names are listed below) of the subject matter 
which is claimed and for which a patent is sought on the invention entitled 

CROSS-CLOCK DOMAIN DATA TRANSFER METHOD AND APPARATUS 

the specification of which 

is attached hereto. 

XXX was filed on November 3. 1998 as 

United States Application Number 09/1 86.046 

or PCT international Application Number 

and was amended on . 

(if applicable) 

I hereby state that I have reviewed and understand the contents of the above-identified 
specification, including the claim(s), as amended by any amendment referred to above. I 
do not know and do not believe that the claimed invention was ever known or used in the 
United States of America before my invention thereof, or patented or described in any 
printed publication in any country before my invention thereof or more than one year 
prior to this application, that the same was not in public use or on sale in the United States 
of America more than one year prior to this application, and that the invention has not 
been patented or made the subject of an inventor's certificate issued before the date of this 
application in any country foreign to the United States of America on an application filed 
by me or my legal representatives or assigns more than twelve months (for a utility 
patent application) or six months (for a design patent application) prior to this 
application. 

I acknowledge the duty to disclose all information known to me to be material to 
patentability as defined in Title 37, Code of Federal Regulations, Section 1 .56. 

I hereby claim foreign priority benefits under Title 35, United States Code, Section 
119(a)-(d), of any foreign application(s) for patent or inventor's certificate listed 
below and have also identified below any foreign application for patent or inventor's 
certificate having a filing date before that of the application on which priority is claimed: 
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Prior Foreign Application(s) 



Priority 
Claimed 



(Number) (Country) (Day/Month/Year Filed) Yes No 



(Number) (Country) (Day/Month/Year Filed) Yes No 



(Number) (Country) (Day/Month/Year Filed) Yes No 

I hereby claim the benefit under title 35, United States Code, Section 119(e) of any United 
States provisional application(s) listed below 



(Application Number) Filing Date 



(Application Number) Filing Date 



I hereby claim the benefit under Title 35, United States Code, Section 120 of any United 
States application(s) listed below and, insofar as the subject matter of each of the claims 
of this application is not disclosed in the prior United States application in the manner 
provided by the first paragraph of Title 35, United States Code, Section 112, I 
acknowledge the duty to disclose all information known to me to be material to 
patentability as defined in Title 37, Code of Federal Regulations, Section 1 .56 which 
became available between the filing date of the prior application and the national or PCT 
international filing date of this application: 



(Application Number) Filing Date (Status -- patented, 

pending, abandoned) 



(Application Number) Filing Date (Status patented, 

pending, abandoned) 
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I hereby appoint Farzad E. Amini, Reg. No. P42,261; Aloysius T. C. AuYeung, Reg. No. 35,432; 

Amy M. Armstrong, Reg. No. P42,265; William Thomas Babbitt, Reg. No. 39,591; Carol F. Barry, 

Reg. No. P41,600; Jordan Michael Becker, Reg. No. 39,602; Bradley J. Bereznak, Reg. No. 33,474; 

Michael A. Bernadicou, Reg. No. 35,934; Roger W. Blakely, Jr., Reg. No. 25,831; Gregory D. 

Caldwell, Reg. No. 39,926; Kent M. Chen, Reg. No. 39,630; Lawrence M. Cho, Reg. No. 39,942; 

Yong S. Choi, Reg. No. P43,324; Thomas M. Coester, Reg. No. 39,637; Roland B. Cortes, Reg. No. 

39,152; Barbara Bokanov Courtney, Reg. No. P42,442; Michael Anthony DeSanctis, Reg. No. 

39,957; Daniel M. De Vos, Reg. No. 37,813; Robert Andrew Diehl, Reg. No. 40,992; Tarek N. 

Fahmi, Reg. No. 41,402; James Y. Go, Reg. No. 40,621; Richard Leon Gregory, Jr., P42,607; 

Dinu Gruia, Reg. No. P42,996; David R. Halvorson, Reg. No. 33,395; Thomas A. Hassing, Reg. 

No. 36,159; Phuong-Quan Hoang, P41,839; Wilimore F. Holbrow III, Reg. No. P41 f 845; 

George W Hoover II, Reg. No. 32,992; Eric S. Hyman, Reg. No. 30,139; Dag H. Johansen, Reg. 

No. 36,172; William W. Kidd, Reg. No. 31,772; Michael J. Mallie, Reg. No. 36,591; Andre L. 

Marais, under 37 C.F.R. § 10.9(b); Paul A. Mendonsa, Reg. No. P42,879; Darren J. Milliken, 

P42,004; Thinh V. Nguyen, P42,034; Kimberley G. Nobles, Reg. No. 38,255; Michael A. 

Proksch, Reg. No. P43,021; Babak Redjaian, P42,096; James H. Salter, Reg. No. 35,668; 
O William W. Schaal, Reg. No. 39,018; James C. Scheller, Reg. No. 31,195; Anand Sethuraman, 

Reg. No. P43,351; Charles E. Shemweli, Reg. No. 40,171; Maria McCormack Sobrino, Reg. No. 
Ifl 31,639; Stanley W. Sokoloff, Reg. No. 25,128; Allan T. Sponseller, Reg. No. 38,318; Judith A. 
Ill Szepesi, Reg. No. 39,393; Vincent P. Tassinari, Reg. No. P42,179; Edwin H. Taylor, Reg. No. 
,jp 25,129; George G. C. Tseng, Reg. No. 41,355; Lester J. Vincent, Reg. No. 31,460; John 
ajg Patrick Ward, Reg. No. 40,216; Stephen Warhola, Reg. No. P43,237; Ben J. Yorks, Reg. No. 
:! §g 33,609; and Norman Zafman, Reg. No. 26,250; my attorneys, of BLAKELY, SOKOLOFF, TAYLOR 
flj & ZAFMAN LLP, with offices located at 12400 Wiishire Boulevard, 7th Floor, Los Angeles, 
\~ California 90025, telephone (310) 207-3800, and Alan K. Aldous, Reg. No. 31,905; Robert 

D. Anderson, Reg. No. 33,826; Joseph R. Bond, Reg. No. 36,458; Richard C. Caiderwood, Reg. 

No- 35,468; Cynthia Thomas Faatz, Reg No. 39,973; Sean Fitzgerald, Reg. No. 32,027; Seth Z. 
|;f Kalson, Reg. No. 40,670; David J. Kaplan, Reg. No. 41,105; Leo V. Novakoski, Reg. No. 37,198; 
% Naomi Obinata, Reg. No. 39,320; Thomas C. Reynolds, Reg. No. 32,488; Steven P. Skabrat, Reg. 
J No. 36,279; Howard A. Skaist, Reg. No. 36,008; Steven C. Stewart, Reg. No. 33,555; Raymond 
]!g J. Werner, Reg. No. 34,752; and Charles K. Young, Reg. No. 39,435; my patent attorneys, and 

Jeffrey S. Draeger, Reg. No. 41,000; Thomas Raleigh Lane, Reg. No. P42,781; Calvin E. Wells, 

Reg. No. P43,256; and Alexander Ulysses Witkowski, Reg. No. P43,280; my patent agents, of 

INTEL CORPORATION; and James R. Thein, Reg. No. 31,710, my patent attorney; with full power 

of substitution and revocation, to prosecute this application and to transact all business in the 

Patent and Trademark Office connected herewith. 



Send correspondence to Charles E. Shemweli , BLAKELY, SOKOLOFF, TAYLOR & 

(Name of Attorney or Agent) 
ZAFMAN LLP, 12400 Wiishire Boulevard, 7th Floor, Los Angeles, California 90025 and 
direct telephone calls to Charles E. Shemweli (408) 720-8598. 
(Name of Attorney or Agent) 
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I hereby declare that all statements made herein of my own knowledge are true and that all 
statements made on information and belief are believed to be true; and further that these 
statements were made with the knowledge that willful false statements and the like so made 
are punishable by fine or imprisonment, or both, under Section 1001 of Title 18 of the 
United States Code and that such willful false statements may jeopardize the validity of the 
application or any patent issued thereon. 



Full Name of Sole/First Inventor David J. McDonnell 



Inventor's Signature /^^^^^^ * WhoHttJ^ Date To^ , 1^ 
Residence Fair Oaks. California Citizenship Canada 



(City, State) (Country) 



Post Office Address 9137 Shady Hollow Way 



Fair Oaks. California 95628 



Full Name of Sole/First 



e/First Itajtentor Andrew M.A/olk 
ite Bay. California 



Inventor's Signature /f^du<x) _ Date Jaa f ; (iff 



Residence Granite Bay. California Citizenship U.S.A. 



(City, State) (Country) 

Post Office Address 7380 Sierra Ponds Lane 

Granite Bay, CA 95746 



Full Name of Sole/First Inventor Michael W. Williams 

Inventor's Signature Date ' 

Residence Citrus Heights. California Citizenship USA 



(City, State) (Country) 

Post Office Address 8386 Zancanaro Court 

Citrus Heights. California 95610 
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Title 37, Code of Federal Regulations, Section 1.56 
Duty to Disclose Information Material to Patentability 



(a) A patent by its very nature is affected with a public interest. The public interest is best served, 
and the most effective patent examination occurs when, at the time an application is being examined, the 
Office is aware of and evaluates the teachings of all information material to patentability. Each individual 
associated with the filing and prosecution of a patent application has a duty of candor and good faith in dealing 
with the Office, which includes a duty to disclose to the Office all information known to that individual to be 
material to patentability as defined in this section. The duty to disclosure information exists with respect to 
each pending claim until the claim is cancelled or withdrawn from consideration, or the application becomes 
abandoned. Information material to the patentability of a claim that is cancelled or withdrawn from 
consideration need not be submitted if the information is not material to the patentability of any claim remaining 
under consideration in the application. There is no duty to submit information which is not material to the 
patentability of any existing claim. The duty to disclosure all information known to be material to patentability 
is deemed to be satisfied if all information known to be material to patentability of any claim issued in a patent 
was cited by the Office or submitted to the Office in the manner prescribed by §§1 .97(b)-(d) and 1 .98. 
However, no patent will be granted on an application in connection with which fraud on the Office was practiced 
or attempted or the duty of disclosure was violated through bad faith or intentional misconduct. The Office 
encourages applicants to carefully examine: 

(1) Prior art cited in search reports of a foreign patent office in a counterpart application, and 

(2) The closest information over which individuals associated with the filing or prosecution of a 
patent application believe any pending claim patentably defines, to make sure that any material information 
contained therein is disclosed to the Office. 

(b) Under this section, information is material to patentability when it is not cumulative to 
information already of record or being made or record in the application, and 

(1) It establishes, by itself or in combination with other information, a prima facie case of 
unpatentability of a claim; or 

(2) It refutes, or is inconsistent with, a position the applicant takes in: 

(i) Opposing an argument of unpatentability relied on by the Office, or 

(ii) Asserting an argument of patentability. 

A prima facie case of unpatentability is established when the information compels a conclusion that a claim is 
unpatentable under the preponderance of evidence, burden-of-proof standard, giving each term in the claim its 
broadest reasonable construction consistent with the specification, and before any consideration is given to 
evidence which may be submitted in an attempt to establish a contrary conclusion of patentability. 

(c) Individuals associated with the filing or prosecution of a patent application within the 
meaning of this section are: 

(1) Each inventor named in the application; 

(2) Each attorney or agent who prepares or prosecutes the application; and 

(3) Every other person who is substantively involved in the preparation or prosecution of the 
application and who is associated with the inventor, with the assignee or with anyone to whom there is an 
obligation to assign the application. 

(d) Individuals other than the attorney, agent or inventor may comply with this section by 
disclosing information to the attorney, agent, or inventor. 
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